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ABSTRACT 



Aims. A model-independent reconstruction of the cosmic expansion rate is essential to a robust analysis of cosmological 
^ ' observations. Our goal is to demonstrate that current data are able to provide reasonable constraints on the behavior of the 
Hubble parameter with redshift, independently of any cosmological model or underlying gravity theory. 
CO ' Methods. Using type la supernova data, we show that it is possible to analytically calculate the Fisher matrix components in 

in ■ 

a Hubble parameter analysis without assumptions about the energy content of the Universe. We used a principal component 
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analysis to reconstruct the Hubble parameter as a linear combination of the Fisher matrix eigenvectors (principal components). 
To suppress the bias introduced by the high redshift behavior of the components, we considered the value of the Hubble 
parameter at high redshift as a free parameter. We first tested our procedure using a mock sample of type la supernova 
observations, we then applied it to the real data compiled by the Sloan Digital Sky Survey (SDSS) group. 
Results. In the mock sample analysis, we demonstrate that it is possible to drastically suppress the bias introduced by the high 
redshift behavior of the principal components. Applying our procedure to the real data, we show that it allows us to determine 
the behavior of the Hubble parameter with reasonable uncertainty, without introducing any ad-hoc parameterizations. Beyond 
that, our reconstruction agrees with completely independent measurements of the Hubble parameter obtained from red-envelope 
galaxies. 
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1. Introduction 
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is accelerating (jRiess et al 



1998 



to explain these observa ti ons ( Cunha et al 



Samsing fc Linderl (j2010l ): 



At the end of the 20th century, observations of type la supernovae (SNIa) revealed that the Universe expansion 

9991). Since these pu b lication s . seve r al efforts have been mad e 



Perlmutter et al 



Freaza et al 



2002) 



2009^ 



Frieman et al 



Ishida 



(2008); 



Ishida et al 



Linderl (|2008l ); 



Linder fc Hutererl ( 20051 ); 



2008) and references therein). In a stan- 



dard analysis, dark-energy models are characterized by a small set of parameters. These are placed into the cosmic 
expansion rate by means of the Friedman equations, in substitution for the conventional cosmological-constant term. 
This approach assumes a specific dependence of the dark-energy equation of state (w) on redshift and provides some 
insight into the probable values of the parameters involved. However, the results remain restricted to that particular 
parametrization. An interesting question to attempt to answer is what can be inferred about the cosmic expansion 
rate from observations without any reference to a specific model for the energy content of the Universe? 

To perform an independent analysis, we used principal component analysis (PCA). In simple terms, PCA identifies 
the directions of data points clustering in the phase space defined by the parameters of a given model. Consequ ently, 



it allows a dimensionality reduction with as minimum an information loss as possible (T egmark et al 



1997 ). The 



importance of a model- independent reconst r uction of the cosmic expansion rate has already been investigated i n 



the literature (Huterer fe Turner 



1999 



2000: 



Tegmark 



2002 



Wang fc Tegmark 



2005 



Mignonc & Bartchnann 



In this context, PCA has been used to recon struct the dark-energy equation of state ( Huterer fc Starkman 



Crittenden et al 



2009; 



Simpson fc Bridldl2006h and the deceleration parameter (jShapiro fc Tumeric 006[l as a function 



2008) 



2003 



of red shift. The use of PCA was also proposed in the interpretation of future experiments results 



Kitching fc Amaral (2009) recall that 



by 



Albrecht et al 



3" 



(2009). In the face of growing interest in the application of PCA to cosmology, 
some care must be taken in choosing the basic expansion functions and the interpretation assigned to the components. 

The main goal of this work is to apply PCA to reconstruct directly the Hubble parameter redshift dependence 
without any reference to a specific cosmological model. In this context, the eigenvectors and eigenvalues of the Fisher 
matrix form a new basis in which the Hubble parameter is expanded. For the first time, we show that it is possible 
to derive analytical expressions for the Fisher matrix if we focus on the Hubble parameter {H{z)) as a sum of step 
functions. The reader should realize throughout this work that our procedure is mostly driven by the data, although 
there is a weak dependence of the components on our starting choices of parameter values. In other words, the 
functional form of each eigenvector is not of primary importance, we are more intereste d in how they are l inearl y 



combined. This approach allows us to avoid many interpretation problems pointed out bv lKitching fc Amaral (|2009). 
Our only assumption is that the Universe is spatially homogeneous and isotropic and can be described by Friedmann- 
Robertson- Walker (FRW) metric. 

The paper is organized as follows. In section [3J we briefly review our knowledge of PCA and demonstrate how 
it can be applied to a Hubble parameter analysis using type la supernova observations. Section [3] shows the results 
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obtained with a simulated supernova data set, following the standard procedure for dealing with the linear combination 
coefficients. We demonstrate that the quality of our results derived from the simulated data are greatly improved if we 
consider the Hubble parameter value in the upper redshift bound as a free para meter. We apply the same procedure 
to real type la supernova data compiled by the Sloan Digital Sky Survey team (jKessler et al.l 120091) . The results are 
shown in sectional Finally, in section [SJ we present our conclusions. 



2. Principal component analysis 

2.1. The Fisher matrix 

The procedure used to find the principal components (PCs) begins with the definition of the Fisher information 
matrix (F). Owing to its relation to the covariance matrix (F=C~ 1 ), it can be shown that the PCs and their associated 
uncertainties are related to the eigenvectors and eigenvalues of the Fisher matrix, respectively. 

We consider that our data set is formed by N independent observations, each one characterized by a Gaussian 
probability density function, fi(xi,af, f}). In our notation, Xi represents the i — th measurement, cr, the uncertainty 
associated with it, and ft is the vector of parameters of our theoretical model. In other words, we investigate a specific 
quantity, x, which can be written as a function of the parameters In this context, the likelihood function 

is given by L — YiiLi h an d the Fisher matrix is defined as 



d 2 lnL(0) \ 



(1) 



The brackets in equation (fTJ represents the expectation value. 

We can write F=D T A D, where the rows of the decorrelation matrix (D) are the eigenvectors (ej) of F, and A is a 
diagonal matrix whose non-zero elements are the eigenvalues (Aj) of F. Choosing D to be orthogonal, with det(D)=l, 
et forms an orthonormal basis of decorrelated vectors (or modes). After finding the eigenvectors and eigenvalues of F, 
we rewrite i as a linear combination of e%. Our ability to determine each coefficient of this linear expansion (a^) is 

— 1/2 

given by a ai = X i ' . Following the standard convention, we enumerate ei from the larger to the smaller associated 
eigenvalue. 

The main goal of PCA is the dimensionality reduction of our initial parameter space. This arises in the number 
of PCs we use to rewrite x. The most accurately determined modes (smaller a ai ) correspond to directions of high 
data clustering in the original parameter space. As a consequence, they represent a larger part of the variance present 
in the original data set. In the same way, the most poorly determined modes correspond to a small portion of the 
variance in the data, describing features that might not be important in our particular analysis. In this context, we 
must determine the number of PCs that will be used in the reconstruction. Our decision must be balanced between 
how much information we are willing to discard and the amount of uncertainty that will not compromise our results. 
The constraint on x reconstructed with M modes (where M ^ Npc and Npc is the total number of PCs), is given 



Ishida & de Souza: Hubble parameter reconstruction 



by a simple error propagation of the uncertainties associated with each PC (jHuterer k, Starkman 



2003) 



'PCS 



Npc 

i=i 



M 

i=i 



(2) 



From this expression, it is clear that adding one more PC adds also its associated uncertainty. At this point, we note 
that to calculate F^i we must choose numerical values for each parameter /3j. This corresponds to specifying a base 
model as our starting point. As a consequence, the results provided by PCA are interpreted as deviations from this 
initial model. The uncertainty derived from fitting the data to this base model should also be added in quadrature to 
equation ([2]), to compute the total uncertainty in the final reconstruction. 

The question of how many PCs should be used in the final reconstruction is far from simple, and there is no standard 
quantitative procedure to determine it. In many cases, th e decision dep ends on the particular data set analyzed and 
our expectation towards them (for a complete review see I Jollifd ([20021 ). chapter 6). One practical way of facing the 
problem is to consider how many components are inconsistent with zero in a particular reconstruction. In most cases, 
the coefficients on tend to decrease in modulus for higher i, at the same time as the uncertainties associated with them 
increases. In this context, we can choose the final reconstruction as the one whose coefficients are all inconsistent with 



The determination of one final reconstruction is beyond the scope of this work. However, to provide an idea of how 
much of the initial variance is included in our plots, we shall order them following their cumulative percentage of total 
variance. 

The total variance present in the data is represented well by the sum of all Xi, and a reconstruction with the first 
M PCs encloses a percentage of this value (£m)> given by 



t M = 100 



ENpc \ 



(3) 



As a consequence, the question of how many PCs turns into a matter of what percentage of total variance we are 
willing to enclose. 



2.2. Investigating the Hubble parameter from SNIa observations 

From now on, we consider the distance modulus, fi, provided by type la supernova observations as our observed 
quantity (xi — fii). In a very simple approach, if we consider a flat, homogeneous and isotropic Universe, described by 
the FRW metric, the distance modulus relates to cosmology according to 



fi(z) = 51og 10 [d L (z)} + no, 
du 

d L {z) = (1 + z) 



(4) 
(5) 



/o H( u y 

where [j,q is called intercept, d^(z) is the luminosity distanc e, and H(z) is the H ubble parameter. We use Hq = 72 



km s 1 Mpc 1 as the current value of the Hubble parameter ([Komatsu et al 



2009) 
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To make H{z) as general as possible, we write it as a sum of step functions 

N bi n 



(6) 



Ci{z) = 



where 

1 if (t - l)Az < z < iAz 
otherwise 

Pi are constants, /? is the vector formed by all Pi , Nu n is the number of redshift bins and Az is the width of each bin. 
This approach was proposed bv lShapiro fc Turner! (|2006T ) in the context of deceleration parameter analysis. Although, 
when it is used for the Hubble parameter, the Fisher matrix calculations are simplified and we still get pretty general 
results. Given the definition above, Pi are now the parameters of our theory. Physically, they represent the value of 
the Hubble parameter in each redshift bin. We can obviously express any functional form using this prescription, with 
higher resolution for a larger number of bins. 

In this context, we are able to obtain analytical expressions for the luminosity distance 

L{z 



and its derivatives 
dd L (z,fi) 



dp k 



-(1 + *) 



L{z) 



d L (z,fi) = (l + z) 



z - L(z)Az 



(7) 



+ 



o 

h(z)+i,k (z ~ L(z)Az) 



^l(z) + l 



(8) 



where L(z) corresponds to the integer part of zj Az. From equations ([T]), (j4]), and ([5]), we can calculate the Fisher 
matrix components as 



Fbi — 



25 



(lnlO) 2 



1 



N 

V — 

i=1 (vdata z d L {Zi;0)y 



dd L (zi,P) dd L (zi,f}) 



(9) 



dp k dp t 

The Hubble parameter may now be reconstructed as the sum of H^ase and a linear combination of the new 

uncorrelated variables represented by the eigenvectors of the Fisher matrix. Mathematically, 

M 

H rec (z;a) = H base (z) +^aiei(z), (10) 
t=i 

where oti are constants and a is the vector formed by all the a*. Using equation (|10p in equations @ and ([5]), we can 
write the reconstructed distance modulus. The data set is then used to find values for the parameters on that minimize 
the expression 



(jM - Li re c(zi;a)) 2 
2a? 



(11) 



This minimization procedure will also generate an uncertainty in the value of parameters on (c™ m ), which should be 
taken into account in the final reconstruction error budget. 
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3. Application 

3.1. Mock sample 

To test the expressions and procedures presented before, we used a simulated type la supernova data set. We consider 
34 redshift bins of Az = 0.05 (0 < z < 1.7), each one containing 50 supernovae. We tested configurations with a larger 
number of bins, but the results are consistent for any configuration with m ore than ~ 2 5 re dshift bins. The uncertainty 



in the i — th bin was calculated according to the prescription proposed in lKim et al.1 (2004) 



/0.15 2 / Az(z-0.5)\ 2 

We performed 1000 simulations of a flat Universe containing a cosmological constant and dark matter, with matter 
density parameter VL m = 0.27 as our fiducial model. 

Our main goal in using this simulation is to obtain an idea of how the procedure proposed here behaves in an 
almost ideal scenario. It represents a simplified version of future data, as for the Joint Dark Energy Mission (JDEM)h 
but it is enough to allow us to check the consistency of our procedure. 

Using the equations shown in the previous section, we calculated the Fisher matrix components. We found that 
the modes are weakly sensitive to the choice of initial base model (values for the parameters ft, hereafter Hb ase ). 
However, if we use a specific cosmological model to attribute values to the parameters ft (ACDM, for example), all 
the results derived from this initial choice can only be analyzed in the face of that model. As our goal is to make 
a model-independent analysis, the best choice is to calculate the PCs based on a model where there is no evolution 
with redshift (ft = etc > 0,Vi). The PCs will then denote deviations from a constant behavior, regardless of the 
value attributed to ft. A constant Hubble parameter is obviously an extremely unrealistic model, although, it does 
allow us to have a better idea of which characteristics of our results are extracted from the data and which are only a 
consequence of our initial choices. 

We do not currently have well constrained information about the e volution of the Hubble parameter with redshift, 
but we do have independent measurements of its value today, H (e.g. iKomatsu et al.l (|2009l) ). Hence, we present our 
results in units of Hq and use a base model in which ft = 1, \/i. The resulting eigenvectors with larger eigenvalues are 
shown in Fig. Q] 

The comparison between the PCs obtained from using Hb ase = ACDM and H{, a se = cte is shown in figure @. 
From this plot, we can see that the difference exist, but the overall shape of the PCs are not very sensitive to the 
choice of H base . 

To clear illustrate the standard-procedure results of PCA reconstruction in the specific case studied here, we show 
in Fig. [3] reconstructions using one to six PCs with corresponding values of tM- From this plot, it is clear that our 
attempt to reconstruct H(z) using a few PCs does not provide the expected results. We have two main problems here: 



1 http://jdem.lbl.gov/ 
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Fig. 1. PCs obtained from our mock sample as a function of redshift. All PCs are shown according to the convention 
that ti(z = 0) > 0. Top: First (red- full), second (blue-dashed), third (green-dotdashed), and fourth (brown-dotted) 
PC. Bottom: Fifth (purple- full) , sixth (cyan-dashed), seventh (magenta-dotdashed) , and eighth (dark blue-dotted) 
PC. 



the reconstruction merely oscillates around the fiducial model (blue-dashed line) and we can clearly see that there is 
a bias dominating the high-redshift behavior. We address both problems in the next section. 
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Fig. 2. Principal components obtained using a constant base model (full-red line) and using ACDM as a base model 
(dotted-brown line). 
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Fig. 3. Reconstruction of the Hubble parameter using equations ([TU|) and (p~Tj) . Hubble parameter reconstructed with 
one (top- left) to six PCs (bottom- right), in units of Hq. The black (solid) line represents our best- fit reconstruction 
and the red (dotted) curves shows 2a confidence levels. The blue (dashed) line corresponds to the the behavior of the 
Hubble parameter in our fiducial model. 

3.2. Minimizing the bias in the reconstruction 

From Fig. [TJ we realize that all the first eight PCs go to zero at high-redshift, which means that at these redshifts 
our data provides little information [^|. Consequently, no matter how many PCs we use or which values we attribute 



This kind of behav ior is also present 



deceleration parameter (|Shapiro fc Turner 



in P Cs from the dark energy equation of state IjHuterer fc Starkmanl I2003T ) and 



2006). 
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to the parameters at , the reconstructed function will always be biased in the direction of our previously chosen base 
model for high z (in this work, "high z" corresponds to the upper redshift bound of our data set. In our mock sample, 

Zynax 1.7). 

At this point, we must pay attention to the crucial role played by Hbase in the standard procedure described so far. 
Although the PCs depend weakly on our choice of Hbase, the final reconstruction is extremely sensitive to the choice. 
Figure (@| shows how different Hbase lead to completely different final reconstructions. 

Searching the literature, we found two d ifferent approaches to dealing with this problem. W e could ignore the 



reconstruction in the region of high redshift (jHuterer fc Starkman 



2003 



Shapiro fc Turner 



2006T) or add a physically 



motivated model for Hbase in equation (|10[) . which would provide us with the value we expect to measure in the 
upper redshift bound (jTang et al.l 120081) . We consider that the first alternative does not represent a good solution. 
The problem is not only the bias at high z, but also the weird behavior present in the reconstruction as a whole. 
Beyond that, our intention is not only to improve the fit quality, but also to make it independent of our initial choice 
of Hbase- The second alternative would produce results in good agreement with the fiducial model (corresponding to 
the dotted-brown reconstruction in figure in a simulated situation. Defining a physically motivated Hbase would 
only, however, introduce another bias. As in reality we do not have access to the "true" value of H(z), this would 
require us to make a hypothesis about the energy content and dark energy model, which we are trying to avoid. 

In this context, we believe that it is reasonable to consider the behavior of H(z) at high z as a free parameter. 
This means adding another parameter {h Zmax ) to equation (fTU|) . which becomes 

N 

Hrec(z) = K max + H base (z) + ^a. t ei(z). (13) 



As a consequence, the new % 2 will be given by 



N 



x\h z 



(jj,i - (J-recizf, h Zmaa: ,a)) 2 



i=l 



2a 



(14) 



and the uncertainty associated with the determination of h Zmax (a nz 
of equation @, leading to 



is added in quadrature to the right hand side 



Vrec( Z ) 



M 

E 

i=l 



(15) 



The effect of this choice is shown in figure §5§. We can see that, no matter which Hbase we use, if h Zmam is consider to 
be a free parameter, the reconstruction at high z is driven by the data. As a consequence, we obtain good agreement 
for the reconstruction using a constant as well as a ACDM model for Hbase- Even though the PCs are not identical 
for non-evolving and ACDM models (figure Q), this agreement is a direct consequence of the best-fit values of the 
set of parameters {a^, h Zmax } always arranging themselves to more accurately describe the information in the data. 

We present in Fig. [5] the results of the reconstruction using 1 to 5, and 10 PCs, and corresponding values of 
Im- Comparing these results with those in Fig. [31 we can see a huge improvement in the agreement between the 
reconstructed function and the fiducial model. The reconstruction with 10 PCs encloses the fiducial model within 2a 



Ishida & de Souza: Hubble parameter reconstruction 11 

confidence levels in the whole redshift range covered by the data. Considering that initially we had 34 parameters 0i, 
this represents a reduction of ~ 70% in the parameter space dimensionality. 

We have so far demonstrated that PCA is an effective method for determining the Hubble parameter behavior with 
redshift. It provides a considerable reduction in the initial parameter space dimensionality, without introducing any 
hypothesis about the energy content, cosmological model, or underlying gravity theory. The reconstruction relies on 
the assumption of a homogeneous and isotropic Universe, described by a FRW metric. The simulated data set used 
above is composed of independent data points, each one associated with a Gaussian probability density function, in a 
flat ACDM Universe. 

In what follows, we apply this procedure to a real (and consequently less well behaved) data set. Our goal is to see 
whether, in a realistic scenario, the effectiveness of the procedure remains. 



4. Results from current SNIa data 



We applied this procedure to real supernova la data compiled by the Sl oan Digital Sky Su rvey (SDSS) SN group, 



hereafter real sample. This sample include mea surements from the SDSS (jKessler et al 



( Miknaitis et al 



2007 



Wood-Vasev et al 



Space Telescope (FIS T) (IGarnavich et al 



SN la measurements ( Jha et al 



1998; 



2009) , the ESSE NCE survey 
2007 ), the Supernova Legacy Surve y iS. VLSIdAstier et al.LoOrl . the Hubble 



Knop et al 



2003; 



Riess et al 



2004, 



20071) , and a compilation of nearby 



2007). The first eight PCs found from the real data set are shown in Fig. [3 We used 



28 redshift bins of width Az = 0.05 and the 287 data points from the aforementioned data set with z < 1.4. The 
resulting reconstructions with one to six PCs are shown in Fig. [5J The blue-dashed line corresponds to the behavior of 
the Hubble parameter in a flat Universe containing dark energy with an equation-of-state parameter for dark energy 
Wd ark = —0.76 a nd m atter density parameter Q m = 0.30. This corresponds to the be st-fit, flat cosm ology found 



by 



Kessler et al 



(2009) in the context of the Multicolor Light Curve Shape (MLCS2k2, (|Jha et al J 120071) ). hereafter 
fXCDM. It is shown here exclusively for comparison reasons, this model was not used in our calculations. 

Comparing Figs. [6] and we realize that the confidence intervals are much larger in Fig. El a s expected, because of 
the observational uncertainties that are present only in the real sample. Beyond that, the contours corresponding to 2a 
confidence levels do not evolve in Fig. [5] as they do in Fig. [BJ This is a direct consequence of our choice of introducing 
h Zmam as a free parameter. In the simulated case, 07^ is much smaller than the uncertainty associated with the 
parameters at. As a consequence, the evolution of the confidence levels is dominated by the uncertainties associated 
with the PCs. In the real case, the opposite situation occurs. For the six cases presented in Fig. [51 07^ ^ o- a , 
making the final uncertainty almost independent of how many PCs are used in the reconstruction. This behavior is a 
consequence of the low number and quality of data points at high redshift. However, it is also related to a non-null 
correlation between the uncertainties in determining the parameters a, and h Zmax . To explore this method in the best 
case scenario, we need to ensure not only that we have high number and quality of data points at high redshift, but 
also that the PCs are as well determined as possible. 
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In comparing Figs. [5] and [5J the reader should be aware that the blue dashed line means different things in each 
figure. Figure [6] is a simulation, and in such a case the blue dashed line corresponds to the fiducial model used to 
generate our mock sample. Fig. [51 was created u s ing re al data, in this case the blue dashed line represents the best- fit 



Kessler et al 



(2009) 



flat ACDM model, as reported by 

We can also see, in Fig. [HJ that the reconstruction becomes more irregular when more than four PCs are used. If 
we take the blue dashed line as a good representation of the "real" cosmological model, we could say that four PCs are 
enough to enclose the desired behavior within 2a confidence level. For the sake of completeness, we plot reconstructions 
up to six PCs. 

To compare our results with other model-independent determinations of H(z), we plot in the top panel of Fig. |9]thc 
re construction with f our PCs, superimposed on measurements of H(z) derived from red-envelope galaxies observations 



by 



Stern et al 



(|2010f) . The error bars associated with these data points are still pretty large, but they already provide 
important insights into the behavior of the Hubble parameter in the redshift range covered by the real data sample. 
We can see that the reconstruction encloses the fXCDM model, as well as agreeing with the red-envelope galaxy 
measurements. To compare our results with the predictions of a standard model-dependent procedure, we show in the 
bottom panel of Fig. El the 2a confide nce levels derived from the error propagation of statistical uncertainties in VL m 



and w reported by 



Kessler et al 



(2009) assuming a fXCDM model and using MLCS2k2. We again find good agreement 
between the two results. 

In Figs. |S]and [9J we point out that the blue-dashed line does not represent the behavior we are trying to achieve, 
but is merely a representation of a model we are used to dealing with. The purpose of showing it here is to provide 
an idea of how far our results are from others presented in the literature, although, in our particular analysis, no 
assumptions about the energy content of the Universe is necessary. 

The determination of what kind of physical and/or systematic effect generates patterns seen in the two lower 
panels of Fig. [8] is beyond the scope of this work. In our interpretation, these results confirm that fXCDM provides a 
good first-order approximation of the real behavior of H(z) within the current observational errors and assumptions 
underlying our procedure. However, a more realistic simulation and detailed study of systematic errors are necessary 
in order to fully understand second-order effects. 



5. Conclusions 

We have presented an alternative procedure for extracting cosmological parameters from type la supernova data. Our 
analysis is concentrated in the Hubble parameter, although we emphasize that the same procedure can be applied to 
other quantities of interest. Our goal has been to be as general as possible, so we have tried to avoid parametric forms 
or specific cosmological models by using PCA. 

Writing H(z) according to equation (j5]) and considering type la supernova observations, we have shown that it is 
possible to obtain analytical expressions for the Fisher matrix. We used a mock sample formed b y 34 redshift bins of 



width Az = 0.05, with errors calculated following the prescription proposed by 



Kim et al 



( 20041 ). This mock sample 
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represents a simplification of future data sets, such as the JDEM, and is not a realistic representation of current data. 
Our goal in using it was to check the consistentency of our procedure. 

Our first attempt in reconstructing the Hubble parameter as a linear combination of the eigenvectors of F was 
unsuccessful. In trying to fit high-redshift data with PCs that go asymptotically to zero, the most oscillatory modes 
propagate their behavior to the reconstructed H(z) in the whole redshift range. As a consequence, the final result 
barely resembles our fiducial model. 

To suppress the influence of the high-redshift behavior present in all PCs of interest, we considered the value of 
the Hubble parameter at high redshift as an extra free parameter in our analysis. This simple modification provided 
reliable results when used with simulated a nd real supernova data. Beyond that, our results are corroborated with 



measurements of red-envelope galaxies from IStern et al.l (|2010l ) 



As a final remark, we emphasize that PCA provides a viable way of avoiding phenomenological parameterizations. 
It represents one of the few statistical methods that allow us to obtain the behavior of a chosen quantity directly 
from the data. It has its own assumptions, such as Gaussianity, independence of data points and in the specific case 
analyzed here, cosmologies that obey a FRW metric. In the final reconstruction phase, it also exhibits a bias in the 
upper redshift bound. On the other hand, the procedure proposed here can drastically suppress the influence of this 
bias. Beyond that, we show that in the context of this work, the Fisher matrix can be analytically obtained. This 
avoids all uncertainties related to numerical derivations of step functions and might be a good alternative to standard 
statistical analyses applied to cosmological data. 
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Fig. 4. Reconstructions using 1 (top- left) to 6 (right-bottom) PCs for different Hb ase , obtained from our mock sample. 
The black (full) line represents the fiducial model ACDM, the brown (dotted) line corresponds to the final reconstruc- 
tions in the case Hb a se{z) = ACDM, the red (full) line corresponds to the case Ht aS e(z) — 0.5, the blue (dashed) 
line shows the reconstruction considering Hb ase — 1.0, and the green (dot-dashed) line is the reconstruction for 
H base (z) = 4.0. 
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Fig. 5. Reconstructions using one (top-left) to six (right-bottom) PCs for different choices of Hb ase and considering 
h Zmans as a free parameter, for our mock sample. The black (full) line represents the fiducial model ACDM, the blue 
(dashed) line corresponds to the final reconstructions in the case Hb ase (z) = Hkcdm and the red (full) line corresponds 
to the case Hb ase (z) = cte > 0. 
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Fig. 6. Reconstruction of the Hubble parameter using equations (|13p and (|14p . Hubble parameter reconstructed with 
1 (top- left) to 5 (bottom- left) and 10 (bottom-right) PCs, in units of H n . The color code is the same used in figure 
©■ 
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Fig. 7. Principal components obtained from real data sample. All PCs are plotted following the convention eiiz — 
0) > 0. Top: First (red-full), second (blue-dashed), third (green-dotdashed) , and fourth (brown-dotted) PCs. Bottom: 
Fifth (purple- full), sixth (cyan-dashed), seventh (magenta-dotdashed) , and eight (darker blue-dotted) PCs. 
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Fig. 9. Top: Reconstruction of the Huble paramet er using four PCs, obtained from the real sample, superimposed on 



direct Hubble parameter measurements reported by 



Stern et al 



( 20101 ) (green dots). Bottom: Reconstruction of H{z) 



with four P Cs and confidence in tervals of 2a obtained from the propagation of statistical uncertainties on Q m and w 

( 20091 ). for MLCS2k2 results. In both panels, the blue dashed line represents H(z) in a flat 



reported by 



Kessler et al 



XCDM cosmological model with w = —0.76 and ft m — 0.30 (best fit results reported by 



Kessler et al 



(|2009j)). Black 



curve corresponds to the best-fit reconstruction using 4 PCs and red lines are 2a confidence levels. 



